Numerical shadows: measures and densities on the numerical range 
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For any operator M acting on an iV-dimensional Hilbert space Wjv we introduce its numerical 
shadow, which is a probability measure on the complex plane supported by the numerical range of 
M. The shadow of M at point z is defined as the probability that the inner product (Mu, u) is 
equal to z, where u stands for a random complex vector from Hn, satisfying ||u|| = 1. In the case of 
N — 2 the numerical shadow of a non-normal operator can be interpreted as a shadow of a hollow 
sphere projected on a plane. A similar interpretation is provided also for higher dimensions. For a 
hermitian M its numerical shadow forms a probability distribution on the real axis which is shown 
to be a one dimensional B-spline. In the case of a normal M the numerical shadow corresponds 
to a shadow of a transparent solid simplex in E^ -1 onto the complex plane. Numerical shadow is 
found explicitly for Jordan matrices Jn, direct sums of matrices and in all cases where the shadow 
is rotation invariant. Results concerning the moments of shadow measures play an important role. 
A general technique to study numerical shadow via the Cartesian decomposition is described, and 
a link of the numerical shadow of an operator to its higher-rank numerical range is emphasized. 
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I. INTRODUCTION 

The classical numerical range W(M) of a complex N x N matrix M is the subset of C defined by 

W{M) = {(Mu,u) lueC^.IHI = 1}. 

This concept has a long history and has proved a useful tool in operator theory and matrix analysis as well as in 
more applied areas; for a nice account of some of the lore of W(M), see [GR1997]. Here we are concerned with the 
measures or densities induced on W(M) by various distributions of the unit vector it; we use the term numerical 
shadow to refer to such densities (see section 2 for our motivation in using this terminology). Although it seems 
natural to study the numerical shadow, this subject does not seem to have received much attention until recently 
The only earlier extended account (that we know of) is the thesis [N1982]. This apparent neglect is not the only 
reason for our attempts to understand the numerical shadow better; the numerical shadow also plays an interesting 
role in quantum information theory. Applications in this area are the focus of a companion paper [DGHMPZ2], in 
preparation. Very recently a preprint by Gallay and Serre [GS2010] has appeared, dealing also with mathematical 
aspects of the numerical shadow (numerical measure, in their terminology). In several ways their development of the 
subject is parallel to our own (as presented, for example, in [Z2009] and [H2010]). In the present paper we stress 
those aspects of our work which are complementary to [GS2010], particularly those based on the moments of the 
numerical shadow. 

In this paper we work mainly with the "uniform" distribution of u over the unit sphere fijv in C N = R 2N , ie the 
probability distribution on Q N that is invariant under all orthogonal transformations of R 2N . In some cases, however, 
the internal structure of W(M) is better revealed through the use of other distributions, and these are of special 
importance for the applications discussed in [DGHMPZ2]. 

Although the numerical shadow is in general difficult to determine explicitly, this is possible in a number of interesting 
cases. Methods based on identifying the moments of the shadow measures are often effective. We also present, via 
the figures, the results of numerical simulations that display shadow densities in various other cases. 

In section 2 we treat the simple situation occurring when M is 2 x 2. Here we obtain a "real-life" shadow. In section 3 
we discuss the analogous treatment for N x N matrices M, ie we view the numerical shadow of M as the image of an 
appropriate measure on the pure states uu* under a linear map ifM- I n section 4 we show that in the case of N x N 
normal M the numerical shadow is the orthogonal projection of a well-placed model of the ( A— l)-dimensional simplex 
(with the uniform density). Thus the density for the numerical shadow is a 2-dimensional B-splinc (1-dimcnsional in 
the Hcrmitian case). 

Section 5 studies the moments of numerical shadows, yielding a key technique for the identification and comparison 
of shadows. In section 6, criteria for the equality of the numerical shadows of two matrices are obtained (in terms, 
for example, of traces of words in the matrices and their adjoints). Evidently equality occurs when the matrices are 
unitarily equivalent, but this is not necessary (if N > 2). 

In section 7, the numerical shadows are found explicitly for the Jordan nilpotents J N . Section 8 extends the techniques 
developed in section 7 to obtain explicit densities for all rotation invariant shadows. 

Section 9 introduces a useful view of the numerical shadow in terms of the (Hermitian) components Re(M) and Im(M) 
in the Cartesian decomposition of M, and the unitary matrix linking those components. Several related aspects of 
the numerical shadow are treated in that section, including a connection with the Radon transform. Section 10 
relates the numerical shadow of a direct sum to the shadows of its summands. Section 11 is concerned with numerical 
approximations of shadow densities in terms of moments and Zernike expansions. 

Finally, in section 12, we relate the numerical shadow of M to the so-called rank-fc numerical ranges Afc(M). The 
theory and applications of these ranges has been advanced vigorously since their introduction only a few years ago 
as a tool in quantum information theory (see for example [CKZ2006a, CKZ2006b, CHKZ2007, CGHK2008, W2008, 
LS2008, LPS2009, and GLW2010]). One way to describe A fe (M) is that it consists of those points (Mu.it) in W(M) 
where u may be chosen from the unit sphere in a whole fc-dimensional subspace of C^. Thus it is natural to ask to 
what extent Afc(M) may be identified as a region of greater density within the numerical shadow. Here the shadows 
corresponding to real unit vectors u play a role. 
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Let us fix some notation. The algebra of complex N x N matrices is here denoted by Mjv(C) or simply Mm- The 
adjoint or conjugate transpose of a matrix M £ Mn is denoted by M*; we consider vectors v in C N as column vectors 
and v* is the conjugate transpose. Our inner product (v,w) may be computed as w*v. Recall that the unit sphere in 
C N is denoted by 0_/v, ie 

n N = {ueC N : \\u\\ = 1}. 

The uniform probability measure on Qjy is denoted by fi. Given M € MjV) the notion of "numerical shadow of Af" 
is captured formally as the probability measure Pm on W(M) such that 

P M (S) = n{u £ n N : (Mu, u)£S}, 

for each Borel subset S of W(M). Equivalently, for any continuous function g : W(M) — > C we have 

f g(z)dP M (z)= [ g((Mu,u))dfx(u). (1) 
Jw(m) Jn N 

If Pm has a probability density (with respect to planar measure in C) it is denoted by /m- In those cases where /m 
is rotation-invariant, ie /m( z ) = /a/(M) we consider Jm as a function of r E (0, w(M)), where w(M) is the so-called 
numerical radius of M: 

w(M) = max{|z| : z € W(M)}. 
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II. THE 2x2 CASE; REAL-LIFE SHADOWS 

Here we compute the shadow density for an arbitrary 2x2 matrix M. Our method is to view this density as a 
real-life shadow of the Bloch sphere model for SI2 ■ An equivalent result was obtained by Ng (see [N1982]) by a rather 
different method. 



The Bloch sphere model sees W(M) as the genuine shadow of a hollow sphere made of infinitely thin semi-transparent 
uniform material, where in general the light would fall obliquely on the (complex) plane. Of course, this situation 
cannot quite be realized physically, but playing with a hollow plastic ball in bright sunlight may yield a good approx- 
imation. It is well-known that when M £ M 2 the numerical range W(M) is a filled ellipse with the eigenvalues of 
M as foci. The following proposition, visualized in Fig. [TcJ supplies further information in the form of an explicit 
shadow density. 

Proposition 2.1: Let E be the filled ellipse formed by W(M) and let a and b be the lengths of the semimajor and 
semiminor axes of E; then the shadow density is 
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2i^ab\J\ — r 2 

at every point on the elliptical curve bounding rE (0 < r < 1). 
Proof: Recall that we assume that u is chosen "uniformly" over {u 



(2) 



1}, ic according to the measure 



fi on SI2 ■ It is known that |wi| 2 will then be uniform in [0, 1]. This a special case of the fact that u uniform in Q jy 
implies (|ui| 2 , |it2| 2 , . . . , |uat| 2 ) has the uniform distribution in the N — 1-dimensional simplex, see |Z1999 BZ2006 



It is also important to note that for fixed u\ the relative phase of u 2 is e where 9 is uniform in [0, 27r]. We have 
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Let z — 2\u\\ 2 — 1; then z is uniform in [—1, 1] and 2ui~u 2 = y/l — z 2 e~ t6 = x — iy, so that (recalling Archimedes) 
(x, y, z) is uniform on the unit sphere. This is one way to see that the corresponding distribution on the Bloch sphere 
{uu* : )i G O2} is uniform. 



Following Davis [D1971], we compute (Mu,u) as 

tr(u*Mu) = tx{Muu*) 



tr(lM + \m 



z x 
x + iy 



For convenience we take M 



1 Q 
-1 



with q > 0. This is a harmless normalization, achieved via translation 



and rotation of M (which respects the distribution of (Mu, u)) and unitary similarity (which leaves the distribution 
unchanged). Then (Mm, u) — z + (g/2)(x + iy) = [z + bx, by) in the complex plane, with b = q/2. Consider the region 
E bounded by the ellipse centred at (0,0) with horizontal semiaxis of length a = y/l + b 2 and vertical semiaxis of 
length b. Given r 6 [0, 1], (Mu,u) lies in rE iff 



(z + bx) 2 (byf 
1 + b 2 + ~tf r 



A calculation verifies that this is equivalent to ((x, y, z) ■ (1/a, 0, —b/a)) 2 > 1 



saying that (x, y, z) lies on either 



of the spherical caps of the unit sphere that are symmetrical about the axis determined by (1/a, 0, —b/a) and have 



5 



J* 0.0 





-0.4 -0.2 0.0 

x 



0.2 0.4 



(a) Numerical shadow of matrix nonnormal ( jj J ) 
resembles physical shadow cast by the hollow sphere 
made of transparent material when illuminated by a 
light source at infinity. 



(b) Cross-section of the shadow along the real axis. 
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(c) Numerical shadow of matrix nonnormal ( § } ) 
resembles physical shadow cast by the hollow sphere 
made of transparent material when illuminated by a 
light source at infinity, but with screen not 
perpendicular to the light rays. 



(d) Cross-section of the shadow along the real axis. 



FIG. 1 



radius r. According to Archimedes (or the related formulas found in calculus texts) the relative area of these caps is 
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1 — Vl — r 2 . Hence the probability 

P{{Mu, u)£rE) = l- yjl-r 2 . 

To find the corresponding planar density, we first observe that the region (r + Ar)E \ rE corresponds to symmetrical 
rings bordering the spherical caps mentioned above. Thus the planar density will be constant on the ellipse bounding 
rE. Its value there is then given by 

,. (1 - Jl - (r + Ar) 2 ) - (1 - x/T^T 5 ) 1 
hm 



Ar->o irab(r + Ar) 2 — irabr 2 2irabVl — r 2 

QED 

This result is equivalent to the formula for the density obtained by Ng (see [N1982], page 67). He computes the 
density of (Mu, u) at (x, y) in the ellipse as 

i 

p(x,y) 



(x 2 /a 2 + y 2 /b 2 ) 

His method seems unrelated to the Bloch sphere approach worked out above. 

III. NUMERICAL SHADOWS AS LINEAR IMAGES OF THE PURE QUANTUM STATES 

It is clear that the argument of section 2 can be extended in part to cases where N > 2. The Bloch sphere is 
replaced by the set of density matrices representing pure quantum states: 

PQS N = {uu* : u £ ^In}- 

Just as before, for any «£!!„ and M £ Mn we have 

(Mu,u) = tv{u*Mu) = tv(Muu*), 

so that W(M) is the image of PQSn under the linear map ifM ■ M N — > C defined by 

if M (X)=tv{MX). 

Since each X € PQSn is Hermitian we may also write 

l p M (X)=tv(MX*) = (M,X) F , 

where (-,-)f is the Frobenius inner product on Mjv- 

Thus the numerical shadow of M may be viewed as the measure on W{M) induced by applying the linear map ifM 
to the fixed measure v on PQSn that corresponds to the uniform fi on Q^- As M varies the resulting numerical 
shadows may be regarded as a tomographic study of the measure v. Thus such detailed information as we have about 
numerical shadows (see section 9, for example, or [GS2010]) reveals much about the structure of v on PQSn- 

IV. THE HERMITIAN AND NORMAL CASES: B-SPLINES 

The standard TV-simplex A^v is defined by 

JV 

A N = {r e R N : all r k > and ^ r k = 1 }- 

fc=i 

This is an (N — l)-dimensional convex subset of R N . We say r is "uniformly distributed" over An to mean that r is 
uniform with respect to normalized (N — l)-dimensional Lebesgue measure voIn-i on A^v- 
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(a) Shadow of matrix diag(0, 1, 3) is a spline function of (b) Shadow of matrix diag(0, 1, 3, 5) is a spline function of 

degree one. degree two. 

FIG. 2: Probability density function of hermitian matrices of dimensions 3 and 4. 



Lemma 4.1: Let P A be the shadow measure of a normal matrix A G Mjv- Then for any Borel subset B of the plane 

P A {B) = Prob{r*A G B}, 

where A = (Ai , . . . , Xn)* is the spectrum of A and r is uniformly distributed over the standard A-simplex Ajv- 
Proof: Since A is normal, it is unitarily similar to diag(A). As fi is invariant under unitary transformations we have 

JV 

P A (B) = Prob{(diag(A)u,u) G B : u G n N } = Prob{(^ \ k \u k \ 2 G B : u G n N }. 

fe=l 



It is known (see e.g. [Z1999 BZ2006 ) that if u is uniform over (ie distributed according to fx) then 

r = (| Ul | 2 ,...,| UAf | 2 ) 

is uniform over the simplex Apf. QED 



With A as above, write A = x + iy where x, y G M. N . Assume first that x and y are independent vectors. Let W be a 
real matrix with x, y as its first two columns and with columns 3 to A forming an orthonormal basis for {x, y} 1 - . For 
any vector v G R N , 

v*X = a + ib {a,b gK) {W*v)i = a and (W*v) 2 = b. 

Thus for Borel BcCeI 2 

Prob{r*A G B} = vol N ^{r G A N : ([W*r) u {W*r) 2 ) G B}, 
and, in view of Lemma 4.1, the density for P A at (a, b) G W(A) is 

woZAT_ 3 {r G A N : (W*r)i = a, (W*r) 2 = b} = voIn-3{v G W* (A n ) : v\ — a,v 2 = b}. 
Let us recall the definition of an s-dimensional B-spline (from [dB1976]). 
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(a) Shadow of the matrix diag(e 2 / 3i,r , e ~ 2 / 3i7r , 1) forms an (b) Shadow of the matrix diag(l, i, — 1, —i) forms a regular 
uniform distribution supported by the triangle pyramid, 
spanning its eigenvalues 



0.5 - 



a> 0.0 




(c) Shadow of the matrix 
diag(e 2 / 5l7r , e 4 / si7r , e 6 / B " r , e 8 / 5i7r , 1). 

FIG. 3: Probability density function of unitary matrices of dimensions 3, 4 and 5. 

Definition 4.2: Let a be a nontrivial simplex in R s+fe . On M s we define the B-spline of order k from a by 

M k ,„{x u ...,x s ) = vol(af]{v e R s+k : v, = xj (j = 1, 2, . . . , s)}). 
Using this terminology we may summarize our results as follows. 



Proposition 4.3: The numerical shadow of an N x N normal matrix with eigenvalues A G C having linearly 
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independent real and imaginary parts has as density a 2-dimensional B-spline MN-2.a(a,b), where the simplex 
a = W*(An) with some W chosen as above. 

Remark 4.4: In the case where the real and imaginary parts of A are dependent (as when A is Hermitian), it is easy 
to see that the numerical shadow is 1-dimensional (a line segment, in fact) with density given by a 1-dimensional 
B-spline (compare [dB1976, Lemma 9.1]). As examples, the shadows of Hermitian matrices of size N = 3 and N = 4 
are shown in Fig. [2j 

Remark 4.5: This observation for the Hermitian case was worked out in detail by Ng in [N1982]. He also made 
the right conjecture regarding the normal case. In a sense, the normal case was earlier understood by statisticians 
studying the distribution of quadratic forms; see for example [A1971, chapter 6]; Anderson points out that some of 
the relevant ideas go back to von Neumann in the 40's. Anderson seems to discuss only real quadratic forms; thus 
the normal case corresponds to roots of multiplicity 2. 

Remark 4.6: In view of Proposition 4.3, the theory of B-splines may be applied to see that normal shadow densities 
are piecewise polynomial functions of two variables in the normal case - see Fig. [3] and of a single variable in the 
Hermitian case. The latter case is analyzed in some detail in Sec. 9. A thorough analysis of the B-spline shadow 
densities for normal matrices is also provided in [GS2010]. 



V. MOMENTS OF THE NUMERICAL SHADOW 



We denote the moments of the numerical shadow of A 6 Mm by 

v sk (A) = [ z^z k dP A {z). 

JW(A) 

Note that, since the polynomials in z and z are uniformly dense in the continuous functions on W(j4), these moments 
determine the numerical shadow uniquely. Moreover, in view of @, we have 

u jk (A)= f (Au,u) j ((Au,u)) k dfj,(u)= f {Au,u) 3 [A*u,u) k d^u). (3) 

Given A € and a multi-index a £ Nq (where No = {0,1,2,...}), we use the following notation: A" = 
A™ 1 Aj 2 • - ■ X N N , \a\ = J2i a k, Oi\ = ai!a 2 !, . . . a N \. We also use the Pochhammer symbol or shifted factorial 
(x) n = n"=i( x + 3 - 1); by convention (x) = L 



The effective evaluation of the moments Vjk(A) depends on the following proposition. 

Proposition 5.1: Given A E Mjy, let A G C N list the eigenvalues of A repeated according to multiplicity. Then 

/ (Au,u) n d[i(u) = — — — ft,„(A), (4) 
Jn N lWJn 

where h n (X) is the complete symmetric polynomial of degree n, ie 

MA) = J2 r - 

aGNj', \a\=n 

Proof: Given multi-indices a, /3, let 

Q(a,/3)= / u a {ufd^{u), (5) 



where the conjugation u is applied entrywise. Since fi is invariant under the unitary map u — > (e j6, iti, 712, . . . ,un) 1 , 
we have Q(a,(3) — e^ Ql ~' 9l ^(3(a, /3) for each real 9. Hence Q(a,f3) = unless ai = fti. Similarly for the other 
components, so that Q{a, /?) = unless a — (3. More work is required to evaluate Q(a, a): 

Q(a,a)= [ \u\^d^u) = J ^-, (6) 
JQ. N ( iv )H 
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where |it| = (\u±\, \ii2\, ■ ■ ■ , |uat|)*. A convenient trick here is to consider 



1= ( e-^^ +v ^\\{x 2 k + y 2 k ) ak dxdy. 
Jm? N \ 



As a product of Gamma-integrals we obtain / = n N a\. Integrating first over rfijv, with r 2 = Yli { x k + y 2 ), then over 
< r < oo, we find that I = ^\S 2N ^ 1 \(N + \a\ - l)\Q(a, a), where \S 2N ~ X \ denotes the (2N - l)-dimensional area 
of Ojv- Since Q(0, 0) = 1, the formula (|6| follows. 



We may assume A is in the Schur upper-triangular form, since this is obtained via a unitary similarity and [i is invariant 
under unitary transformations on C . Thus Ajj = Xj (some listing of the eigenvalues of A, with multiplicity) and 

JV 

(Au, u) = Xj \uj \ 2 + AijUjWi. 

3=1 3>i 

Aside from (J2j ^j\ u j\ 2 ) n ^ the terms of (Au, u) n are scalar multiples of expressions of the form 

a b 

n i u ^i 2 n 

k=l k=l 

where b > 1 and each jk > i^. Such an expression has the form u 1 (u) 7 u a (u)^ where, using e(j) as a temporary 
notation for the multi-index with 1 in the j— th position and O's elsewhere, 



b 

n 

k=l k=l 



Clearly a± — and for some first ko > 1 we have a ka > 0; hence j3 k > for some k < ko, so that a ^ (3. Thus 
Q(7 + a, 7 + /3) = so that such terms make no contribution to the integral over 0^. It follows that 



f {Au lU ) n dv(u) = f C£X 3 \ U] \ 2 ) n dn(u). 

JQjy </Qjv j = \ 

Using the multinomial formula and ([6]), this integral is 



-XQ(a,a)= ^ -A 



a! ^ v ' y ^ a! (JV) n 

aeN™, |a|=n aSN™, |a|=n 



QED 



It will be convenient to use the notation A (A) to denote any listing of the eigenvalues of A, repeated according to 
multiplicity. Applying Q with A replaced by sA + tA* (t, s real) and recalling ^ we obtain 

J2 {^y^^n-M) = j^-h n (X(sA + tA*)). (7) 

Moreover, the RHS of ^ may be evaluated in terms of traces of words in A and A* , using known relations |Macl995] 
among h n (X), the power sums 

ft(A) = f>i 

k=l 



(these equal tr(A J ) if A = X(A)), and the elementary symmetric polynomials 

e jW = X] K x Xi 2 . . . \ H 

l<i 1 <i 2 <-<ij<N 

(note that e,j(\) = if j > N; by convention eo(A) = 1). For n > 1 we have 

e h n ~ e\h n -x + e 2 h n - 2 • • • ± e n h a = 

(by convention ho(X) = 1) and 

ne n = pie„_i - p 2 e n - 2 H ± P«e - 

For example, /ii = pi so that Q implies 

^o,i (A) + si/ li0 (A) = jj Pl {\(sA + tA*)) = ^tr(sA + tA*) 
= i(i tx{A*)+stx{A)). 

Hence 

p lfi {A) = itr(A) and u 0tl {A) = ^tx{A*). 
Likewise lh 2 — e.\h\ — e 2 \iQ = e\h\ — |(piei — p 2 eo) = p\ — \v\ + = 5(^2 so that ([7]) implies 

t 2 u , 2 (A) + 2tsv hl (A) + s 2 u 2<0 (A) = 2 + (^tr( S A + tA*) 2 + itr 2 (.sA + iA*)) 
' (t 2 (tr(/T) 2 + tr 2 A*) + 2ts(tr{AA*) + tr(A)tr(A*)) 



N(N + 1) 
+ s 2 (tr(A 2 ) +tr 2 A)) 

(using tv(A*A) = tr(AA*)). Thus we have 

tr{A^ + t^A tT(AA*)+tr(A)tT(A*) 
V2 < o{A) ~ N(N + 1) ' " lAA) ~ N(N + 1) ■ 

and 

_ tr(A*) 2 +tr 2 A* 
"° AA) N(N + 1) ■ 



Similarly we find that /13 = ^p$ + \p\p 2 + gPi, so that (|7|) implies 

t 3 v , 3 (A) + M 2 sv h2 {A) + 3ts 2 v 2A (A) + s 3 v 3 ,o( A ) 

f -tt(sA + tA*) 3 + -ix(sA + tA*)tr(sA + tA*) 2 



N{N + l)(N + 2) x 3 v ' 2 

+ ^tv 3 (sA + tA*)) 

' (t 3 {2tv(A*) 3 + 3tr(A*) 2 tr(A*) + tr 3 A*) 



N{N + l)(N + 2) 
+ t 2 s(6tr(A{A*) 2 ) + 6tr(A*)tr(AA*) 
+ 3tr(A)tr(A*) 2 + 3tr(A)tr 2 A*) + ts 2 . . . ) 

where again the cyclicity of the trace plays a role. 
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From these calculations we obtain 

»,,o(A) = iv(iv + 1 1 )(jv + 2) (2tr(A 3 ) + 3tr(^)tr(A) + tr3A), 

viAA) = N(N+ 1 1){N + 2) (^A*) + 2tr(A M AA*) 

+tr(A 2 )tr(/T) +tr 2 A ti(A*)), 
and ^1^2(^4), 1^0,3(^4) by interchanging the roles of A and A* . 

What is not clear from the approach above is the important fact that all the moments Vjh (A) are polynomials in the 
traces of A, A* words of length at most N. One way to see this is to note that (fsb and (|9|) allow us to express h n for 



n > N in terms of pi,p2, ■ ■ ■ ,Pn- For example, when iV = 3we find that hi = fpiP3 + yjPi + \v\- 

In general then we need only compute tr(sA+tA*) k for k < N, and therefore (in view of the noncommutative binomial 
formula) we need only compute tr(Pkj(A, A*)) for k < N, where 

Pkj(x, y) = {z\Z2 ■ ■ ■ Zfc where Zi = x if i £ S, Zi — y if i ^ S}. 

SC{l,2,...,fe}, #(S)=j 



We summarize this discussion in the following proposition. 

Proposition 5.2: Given A £ Mjy, all the moments Vi t j{A) of the shadow measure (and therefore the measure Pa 
itself) are determined by the traces of {sA + tA*) k (as polynomials in s,t) for k < N. Thus they are determined by 
the values of 

tr(P kJ (A,A*)) 

for k < N. 

Remark: The cyclicity of the trace (ie tr(^4_B) = tr(BA)) reduces tr(Pkj(A, A*)) to a single term when k < 3 but 
this is not always the case. For example 

*\2 



tr (Pa, 2 (A, A*)) =4ti(A 2 (A*) 2 ) +2ti(AA*) 



The information about moments Vj^(A) that is provided by |7|) may also be encoded in the series 



s(A, s ,t, q ) = f>"^GT ("y*"~X«-i(^)) 

n=0 ' j=Q ^ ' 

00 

= J2l n hn(X(sA + tA*)) 



n=0 

(absolutely convergent for small real s, i, q). 

The following proposition provides powerful alternative forms for this series. 

Proposition 5.3: Given A £ we have (for sufficiently small s,t,q) 

S(A, s, t, q) = det" 1 (I - q(sA + tA*)) (10) 

and 

N 



fc=0 



S(A, s ,t,q) = (Y,(-q) k e k (\(sA + tA*))) ■ (11) 
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Proof: These follow from the identities 

oo N 

e 9 ,i ma) = n> + ^ + a 2 + ■ • ■ ) 

n=0 j=l 

N N 
, -1 



3=1 fc=l 



QED 



Remark 5.4: In view of ( 10 1, the shadow measure Pa is completely determined by det(/— (sA+tA*)) as a polynomial 
in s, t (q may be absorbed into t, s). 

Remark 5.5: In view of Q, the relation (11) provides another viewpoint on Proposition 5.2. 

VI. CRITERIA FOR EQUALITY OF NUMERICAL SHADOWS 

Given A,B € Mjy, we have seen in the last section that Pa = Pb 

iff 

tr(sA + tA*) k = tr(sB + tB*) k (12) 



(as polynomials in s, t) for all k < N 
iff 



for all j < k < N 
iff 



tr(P ktj (A,A*)) =tr(P fcJ (£,£*)) (13) 



det(I - (sA + tA*)) = det(J - (sB + tB*)) (14) 

(as polynomials in s,t). 

Since the uniform measure [x on f2jv is invariant under unitary transformations, Pa — Pu*au f° r an Y unitary U. It is 
natural, therefore, to ask whether the numerical shadow Pa determines A up to unitary similarity. This is the case 
for A £ M2, for example, since the ellipse E — W(A), just as a set, determines an upper-triangular form for A: A is 

(y. s 

unitarily similar to ^ ^ where the eigenvalues a, (3 are the foci of E and s is the length of the minor axis of E. The 
answer is "yes" also for normal matrices A since the eigenvalues are determined by Pa in that case (see section 4). 



More generally, however, the answer is "no" , on several levels. First of all, the measure [i is also invariant under any 
orthogonal transformation of M. 2N = C N , so that, in particular, dfi(u) = dfi(u). Thus A and its transpose A 1 , though 
they are not usually unitarily similar, always have the same numerical shadow: 

(Au,u) = u*Au = (u*AuY = u'A^u*) 1 = (tt)*A% = (A%u). 

In fact, the maps A 1— > JJ *AU and A M> U*A t U, are the only linear maps on Mm that preserve the numerical shadow, 
since they are the only linear maps that preserve the numerical range as a set (see C.-K. Li's survey [L2001]). 

Moreover, particular pairs A, B may have the same numerical shadow without being related by unitarily similarity 



or transpose. This phenomenon is somewhat clarified by comparing the trace criterion ( 13 ) for Pa = Pb with the 
analogous criteria for unitary similarity. In a 1940 paper [S1940] Specht observed that A and B are unitarily similar 
iff 

ti(w(A,A*)) =tx(w(B,B*)) 

for all two-variable words «;(•, •). Since then much work has been done with the aim of limiting the set of words 
required in Specht's criterion when matrices of a given size are involved. In [DJ2007] Djokovic and Johnson provide 
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a welcome account of recent results in this direction. In particular, the following result (see Theorem 2.4 in DJ2007) 
may be compared with (13). 



Proposition 6.1: Given A,BG Mjv, there exists unitary U such that B — U* AU iff 

ti(w(A,A*)) =tr(w(B,B*)) 

for all words w(-,-) of length < iV 2 . 



(15) 



The disparity between ( 15 ) and ([13]) certainly suggests that A and B might have the same numerical shadow without 
being simply related by unitaries. Let us see how this does occur when N = 3. In [DJ2007], Djokovic and Johnson 
refer to a result of Sibirskii: the unitary equivalence class of A 6 Ma is determined by 

tr(A),tY(A 2 ),tiiAA*),tr(A 3 ), 
tiiA 2 A*),tT(A 2 (A*) 2 ) 1 and tr(A 2 (A*) 2 AA*), 



and this set is minimal. In contrast, (13) tells us that the numerical shadow Pa is determined (when A € M3) by 

tr(A),tr(A 2 ),tr(AA*),tr(A 3 ), and tr(A 2 A*). 
Thus we expect to find A,Bd M3 such that Pa — Pb but A and B are not unitarily related. 

A class of specific examples is provided by 





"0 


X 


0" 




"0 


y 


0" 


A = 


y 








,5 = 


X 










z 










z 









Note that det(I — (sA + tA*)) = 1 — s£(|ar| 2 + \y\ 2 + \z\ 2 ) — s 2 xy — t 2 xy. Since this expression is symmetric in x, y, 
(14) tells us that Pa — Pb- Consider the choice x = 0, y = z = 1: then A has rank 1 while B has rank 2. Clearly B 



is not unitarily similar to A or to A . 

Remark: The common numerical shadow of these A, B is identified explicitly in section 7, because B is unitarily 
equivalent to the Jordan nilpotent J3. 



VII. NUMERICAL SHADOWS OF JORDAN NILPOTENTS J N 



Here we compute explicit shadow densities for certain special matrices, focusing on the Jordan nilpotent Jn, ie 
Jn € Mjv(C) with l's on the superdiagonal and 0's elsewhere. Of course, the discussion of the 2x2 case in section 2 
applies to J 2 and shows that the planar density of the shadow Pj 2 at z 6 C is /2(|z|) where 



27r(l/2)Vl~4r 2 W 1 - 4r 2 ' 

since W{J2j is a disc of radius 1/2. The shadow density for J3 can be computed by several methods but here we'll do 
it as the simplest case of a general method that exploits the moment techniques from section 5. We shall see that the 
shadow density for Jn is an alternating sum of densities supported on discs with centre at and with various radii, 
the largest being cos(it/(N + 1)). This is a striking development beyond the well-known numerical radius formula: 
w{Jn) = cos(ir/(N + 1)) (see [DH1988] for information about the numerical radii of certain matrices with simple 
structure; for more, see [HS2010]). 

Observe first that the shadow measure Pj N is certainly circularly symmetric about 0; in fact Jn and e Jn are 
unitarily similar (use U = diag(l, e t6 7 e l2e , ...)). Thus, from Proposition 5.3, we have 

V { -^s m t m v mm {J N ) = det-\lN ~ (sJ n +tJ* N )). (16) 
■'— ' m!m! 

m=0 
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We may take s = t and identify v mm (JN) via the coefficient of t 2m in det 1 (//v — f(Jiv + Jj\r))- Now the eigenvalues 
of Jjv + Jjv are well-known: 

2cos (]vTi) ( fc = 1 > 2 ,...,iv). 

[Some say that this was the first nontrivial eigenvalue problem ever solved, and that it goes all the way back to 
Cauchy.] 



Thus the RHS of (16 1 (for s = t) can be calculated explicitly. The details appear in the proof of the following 
proposition. 



Proposition 7.1: For each N > 2 and m = 0, 1, 



kit , N m ml ml 



v mm ( J N ) = Ck ( cos2 ( ^xTTT } ) 1 I A , , A^_L 



k—l \ 2 > m \ 2 > m 

where 

fe-i 2 W+1 ■ 2r kn \f I k7T \\ N ~ X 



, -,\k-l 1 ■ 21 fe7r \ ( I fe7r \V 



Proof: By (16 1, with (small) s = t, 



E ( N )m 2m,, / t s _ 
. ft Vmm\ J N) — ^ N 



m'.m 

m— 



0^(1-2*008^)) 

Now the cos(-^pj-) are the roots of the monic polynomial Cn{x), where 

1 sm(N + l)9 

(a version of the Chebyshev polynomials of the second kind). Thus 

1 1 N 

1 _ 1 \ - ak 

cvM " n?=i(* - ~tA x - cos (^tt)) ' 

where the coefficients a& in the partial fraction decomposition are given by = l/Cjy( cos ( jv+i ))" Using the formula 
for Cm (cos 9) we find that 

2 N sin 2 !' k7r 1 
^ = (-1)^ &m ^+i J . 
v ; 7V + 1 



We now have 



(2i)*nf=i((l/2*)-cos(^ 

1 f (-l)^ 1 • 2, kTT 

t N 2^ N + 1 bm ^ + i j 



^o- ! - ! j (^^((V^-cos^)) (2t)"CW(l/2t) 



= * 2 y(-l)^ 1 sin 2 (^^)y(2tcos(^^)) 3 . 
fc=i i=o 

Evidently the summed coefficients for j odd and for j < N — I are [we need not worry about how this happens!] so 
that the term in t 2m for the final expression corresponds to j = 2m + N — 1. Thus 

v-^, ,t i 2^ ,, far >/ , kn ^\ N ~ 1 , n, kir m!m! 

«w(J*r) = E(- 1 )" 1 ^ T T sin2 (^)( C0S (ivTI ) ) (4cos ( ¥TT )} 



fc=l 



(JV) 



2/» 
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Note that the terms for k and N — k + 1 are the same and that (N) m may be replaced by )ro(^2~^)ro2 2m ■ Then 
(J N ) is given by 

_JV/2j 



. , N t_i 2 W+1 . k >, . fc7r , n jv-1/ 2/ fc 71 " N\m to!to! 



(the additional factor of 2 is correct even if N is odd because then cos(-^j-) = for k = (N + l)/2). QED 
The value of Proposition 7.1 lies in the possibility of identifying explicitly those densities with moments 



to! to! 

V 2 > m V 2 / r 



2 Jm \ 2 > m 

Proposition 7.2: Suppose f{x) and (7(2) are probability densities on [0, 1] with moments 



/ x m f(x) dx = a m , / x m g(x) dx = b m . 
Jo Jo 



Then 

(i) for any b > 0, (l/b)f(x/b) is a probability density on [0, b] with moments b m a m , 
and 

(ii) a probability density on [0, 1] with moments a m b m is given by 

h(x)= f f{ S )g{-)-. 

Jx S S 

Proof: (i) With the substitution y = x/b, 



J ^/(f ) dx = f Q b m y m f(y) dy = b m a m . 



(ii) Consider independent random variables X, Y with /, g as probability densities. Then XY has moments 

E((XY) m ) = E(X m )E(Y m ) = a m b m . 
Since X, Y have joint density f{x)g{y) 1 ¥rob{XY < t] is 



f g(y) dyj dx = f(x) (J g(y) dyj dx + £ f(x) (J g{y) dyj dx, 



1 m 

for t G [0, 1]. Differentiate with respect to t to obtain the density h(t) for XY: 



f{t)-f{t)+f t 
QED 

We can now compute the density Fm{x) on [0, 1] having moments 



Si'" 



mi mi 



F N {x)dx-- , 

V 2 )m \ 2 )m 

for to = 0,1,2,... and N > 2. For any f3 > we have the beta-integrals 

to! 



/' 

Jo 



px m (l-xf- 1 dx 



for to = 0, 1, 2, . . . (use induction on to via integration by parts). For N = 2 take /3 = 1/2 to see that 

F 2 (x) = —^=. 
2vl - x 
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For N > 3 we apply Proposition 7.2(ii): Fn(x) = h(x) computed with 



Consider even N = 2b. 



[ 1 2t-l ^21-2 X2_^ds 

(€- 1)(2£- 1) 



2 

With the substitution s = 1 — u 2 we obtain 



- [ {l-sf-iis-xf-H 1 - 

J x 



ds. 



,2^-2/1 „ „,2\^-2 ttu 



F n {x) = {l-l){2i-l) £ u»-\\-x-u*Y (1 _ (l , y -y 



For odd TV = 2^ + 1 we reverse the roles of / and </ to obtain 



s 



.2^-2/1 „ „.2\^-l ^ u 



*(2*-l) / u^(l-a:-«Y 



(l-M 2 )^ 



The integrals representing Fn (x) are elementary in the sense that they may in principle be computed explicitly (using 
partial fractions, for example). In particular, 



^ du , l + yi - ^ 



F ^( x ) = / 1 2 = lo §" 

1 w va; 



o 



and 



c?u . 1 + \/l — x 



Fi(x)=3 u 2 ^=31og ^= 3VT 



o 



1 — ?i : 



In fact, there is a recurrence relation for the Fjy(x) that makes the calculation of Fjv for TV > 4 a simple task; such 
matters are discussed at the end of this section. 

Returning to shadow densities, let the planar density of Pj N at z £ C be denoted by fjy (\z\) so that 

r-2-rr r"w{JN) pw(Jn) 

v mm {Js)= I / r 2m f N (r)rdrdO = 2n r 2m+1 f N (r) dr. 

JO Jo Jo 

With the substitution x = r 2 we have 

rW 2 (J N ) 

V m m(JN) = 7T / X m f N (y/x) dx. 

Jo 

In view of Proposition 7.1 and Proposition 7.2(i), 

rw' 2 (J N ) 
7T / X m f N (y/x) dx = 

Jo 
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where Ck are as in Proposition 7.1. Thus 7t/jv(\/x) and 

i_JV/2j 



„„„2 _fc7T_ ^ V pn n2 fc7T / 

fc=l uus N+l Luh Af+1 

coincide (since they have the same moments). We obtain 

1 LAr/2j c x 2 

fN(x) = - Sky Fw ( m ,2 )> 

fe=l LOh JV+1 COb JV+1 

an (alternating) sum of densities supported on [0, cos j^j] ; in particular we have a greatly refined version of the result 

w( J N ) = cos j^pj- (= max fe cos ^). 

In summary, we have proved 

Proposition 7.3: The radial density /jv(r) of Pj N is given by 

_AT/2j 



<■ l \ 1 L V^ J / 2 JV+1 . 2 fc7T , klT , JV-3 „ . 
/jvM = - > (-1) sm 7( C0S T7 7) 

^ v ; 7T f-^ v ; 7V + 1 iV + l v iV + 1 7 y 



-.+1 iV + l v JV + l' ™ y cos 2 ^' 

fe=l JV+l 

for any TV > 2. 

For TV = 2 we see again that 

, , , ( 8 . 2 7T _i 7T 1 \ 2 1 

/2W =t Sm 3 C ° S 3 2v /l-rVco^fi = ^7r^- 

Likewise 

... 4 . 2 tt r 2 2 1 + A /T^2^ 

/sW = - sin' - F 3 ( — ) = - log -= . 

tt 4 v cos' j ' tt v / 2r 

For N > 3 the radial density combines densities on discs of several different radii. For example, 

, , . 32 / . 2 TT 2 n 1-. C r2 \ • 2 27T 2 271" „ / T 2 s\ 



1 / 8r 2 



8r 2 



3 + \/5 y v 3-\/5 

We shall see that the functions Fn are the basic building blocks for many circularly symmetric numerical shadows. 
Hence it will be worthwhile to explore their properties more thoroughly. To this end, we introduce the following 
hypergeometric series: 

in this context H is often denoted by 2-^1- Here we may assume that the parameters a, 6, c are real and that 
c 7^ 0, —1, —2, Note that the series converges absolutely for \t\ < 1 since it has the form ^ djP where 

\d J+l /d 3 \ = \(a + j)(b + j)/((c + j)(j + l))\ ^ 1. 

Recall the Gauss summation formula, which tells us that the series converges also for t — 1 whenever a, b > and 
c — a — b > and that, in such a case, 

H(n h -r-U- TiC - a - b)T{c) 

F(a,6 )C ,l)- r(c _ a)r(c _ b) . 
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Given p > 0, 6 > and k = 1, 2, . . . , let 



for < x < 1. 



(17) 



Proposition 7.4: The function denned by (17) is a probability density on (0, 1) with moments 



to! (k) T 



x m G(x) dx = , 



(m = 0,l,2,...). 



Proof: Evaluating the beta-functions Jq x m (—x)^ +s 1+J dx as 

m! + 5 + j) _ m\r{P + S)(/3 + 8) j 



T(P + S + j+m + l) r(/? + S)[fi + S) m+1 



+J 



ml (0 + 5)j 



(p + S) m+1 (P + 5 + m + l)j 



we see that 



= to! r(/3 + i)r(^ + fc) ~ {5) 3 {fi + i-k) 



Using the Gauss summation formula, we obtain 

to! T(/3 + l)r(<5 + k) r(m + fc)r(/3 + S + m + 1) 

rCS + <$)(*:- 1)! (/3 + i) m +i ' r(/3 + 1 + m)r(5 + k + to) 

to! T(/3 + l)r(<5 + fc) (A; — 1 + to)! T(p + 5 + to + 1) 



r(/3 + s + to + i)(fc - 1)! r(/3 + i)C8 + i) m r(<5 + *)(* + fc), 

to! (fc) ro 
(/3 + 1 W<5 + fc) m ' ^ 

Taking fc = 1, fj = (N - 2)/2, 5 = (iV - l)/2 we see in particular that 



r(f)rm, 



.AT- 1 N - 2 



F N (x) = ~ V2 /~ V |r -;N- 

r(iV-|) 2 ' 2 ' 2' 



(18) 



Several useful recurrence relations will follow from the following general recurrence for H. 
Lemma 7.5: If a, b are real and b > 1 then 

(a - i a - 1; b; t) - H (a - 1, a - |; 6 - 1; t) 

(26 + l-2o)(o-l) TTI 1 , , s 
2b(b^l) - -tH(a,a- -;b+l;t). 

This lemma may be verified by a careful comparison of the terms involving P +1 (j = 0, 1, 



Using (18) and invoking the lemma with t = 1 — x, a = (N + l)/2, b = N — |, we obtain the recurrence relation for 
F N+2 {xT\N>2): 



N + l 



N+2 



(19) 
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In fact, then, each Fn(X) has the form a]\r(x)F2(x) + &jv(x)i*3(x) for certain polynomials a^jfo^r. 



We extend the definition of Fn(x) by setti ng it equal to for x > 1; this is the natural continuous extension except that 
F2(x) = (2\/l - x)^ 1 t o° as x f 1. From (18 1 it is clear that for iV>3we have F N (x) -> as x fjj h ence ajv(l) = 
for AT > 3. We may also examine the behavior of the functions F^r at 0: i*3(x) = log(l + \/l — x) — g logs j" 00 
as x 4- 0, whereas -F2(x) — > |. On the other hand (18) shows that, for AT > 3, Fn(x) tends to a constant times 
H((N — l)/2, (N — 2)/2;N — 3/2; l) as x 4- 0; the Gauss summation formula tells us that this limit is +00 (since 
c — a — b = 0). Thus 6jv(0) > for A^ > 3, and F N (x) grows like — logx as x | 



VIII. ROTATION-INVARIANT SHADOWS 



Here we shall see that the methods of section 7 extend to determine explicit densities for all rotation-invariant 
numerical shadows. These are shadows of A £ Mjv such that A and e A have the same shadow for all real 9. 
Characterizing such A in terms of moments is easy: Vjk(A) = whenever j =/= k. More elusive are characterizations 
directly in terms of A. 



Simple examples are provided by the "superdiagonal" matrices: ie A such that = unless j = i + 1. For such A 
we actually have e A unitarily similar to A: let U = diag(l, e l6 , e l2e , . . . ); then U*AU = e A. The Jordan nilpotents 
Jjv are special cases of these superdiagonal matrices. 



More generally, consider the incidence graph G(A) of A £ M^: vertices are {1,2, . . . , A^} and i,j are joined by an 
edge iff a,j ^ 0. The interesting case in this context is when G(A) consists of disjoint chains (no cycles are allowed; in 
particular, A has zero diagonal) . One can see that this condition is equivalent to requiring that A have zero diagonal, 
have no more than two nonzero entries in each cross-shaped region formed by the A:-th row and the A:-th column, 
and that G{A) have no cycles. 



Proposition 8.1: If G{A) consists of disjoint chains, then A and e z6 A are unitarily similar (so that A has rotation- 
invariant shadow). 

Proof: Consider the unitary U — diag(u) where uj k = e zke for each chain 

h -> 32 -> > ik -> > 3k 

of G(A) (it does not matter which orientation of the chain is chosen). Set Uj = 1 for any j that does not occur in any 
of the chains that make up G(A). Note that 

(U AU)j k j k+1 — e a<j k j k+1 e *■ + -* = e &j k ,j k+1 - 

Since other entries of A are 0, we do have U*AU — e l0 A. QED 



This proposition applies, for example, to superdiagonal A as well as to strictly upper-triangular A that are "subper- 
mutation" matrices, ie have at most one nonzero entry in each row and in each column. 



The next proposition notes that A with rotation-invariant shadow must be nilpotent, so that it is unitarily similar to 
a strictly upper-triangular matrix (Schur form). 



Proposition 8.2: If A £ Mm has rotation-invariant numerical shadow, then all eigenvalues are 0. 

Proof: Putting t = in Q, we see that h n (X(A)) = (n > 1); indeed, this is the case whenever v n> o(A) = 0. From 

tand (|9j) we conclude that p n (X(A)) = for n > 1. Thus Yli P(^k) — for any polynomial p(x) with p(0) = 0. 
ppose \i occurs with multiplicity m. Let 

p(x) =x JJ (x- Aj); 

then 



JV 

= Y,P(^k)^mX l (Aj — Xj), 

1 Aj^Ai 
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so that Xi = 0. QED 

Proposition 8.3: The matrix A £ Mm has rotation-invariant numerical shadow 
iff 



iff 



(i) det(/ — (sA + tA*)j is a function of st; 



(ii) A is nilpotent and tr(P kj (A, A*)) = (- < j < k < N). 



Proof: In view of (10), (i) is equivalent to v k AA) = for k ^= j. To see that (ii) follows from rotation-invariance, 
invoke Proposition 8.2 and apply (13) with B — e 10 A to obtain 



tv(P ktj (A, A*)) = e^ k -^ e tr(P kj (A,A*)). 



(20) 



When j 7^ k/2, this cannot hold (for all 9) unless tr(P k ,j(A, A*)) = 0. For the converse, note that (20) is automatic 
when j = k/2 and that nilpotence ensures that both sides of (13) are zero also when j = k. For j < k/2, note that 



tr(P k , j (A,A*)) = tv((P ktk _ j (A,A*)Y)=tT(P ktk _ j (A,A*)). 



QED 



When N = 3, either (i) or (ii) easily implies that the upper-triangular form of A £ M 3 with rotation-invariant shadow 
is 

x y 
z 


where at least one of x,y,z is zero. For example, the only condition in (ii) is that ^(P^^iA, A)) = 0, ie that 
tr(A 2 A*) = 0, and one easily computes tr(A 2 A*) — xyz. Note that A and B = e %B A are unitarily similar. One 
can appeal to Proposition 8.1 to see this or use Sibirskii's list of words (mentioned in section 6): all the traces are 
automatically the same for nilpotent A and B except that ti(A 2 A*) = tv(B 2 B*) requires tt(A 2 A*) = 0. 

For N = 4 it is perhaps more convenient to use (ii) to identify those A having rotation-invariant shadow. Let the 
upper-triangular form of A be 

a b c 
x y 
z 
.0 0. 

The only conditions in (ii) when N = 4 are tr(P 3i2 (A A*)) = and tr(P 4j3 (A, A*)) = 0, ie tr(A 2 A*) = and 
tr(A 3 A*) — 0. Computing these traces we find that A has rotation-invariant shadow iff 



axb + (ay + bz)c + xyz = and ax 2 zc = 0. 



(21) 



Remark 8.4: Although the earlier examples of A with rotation-invariant shadow were also unitarily similar to e %e A, 
the analysis (above) of the 4x4 case shows that this is not necessary. If A and e A are unitarily similar we must have 
tr(A 3 (A*) 2 ) = 0, ie ax 2 z(ay + bz) = 0, and this does not follow from (21 1 (eg take c = 0, a = b = x = 1, y = -1/2, 
and z — 2). 



If A G Mn has rotation-invariant shadow, the relation ( 10 ) simplifies: 



y t 2m (^)a™ (A j = det -i (/ _ 2tReA) 
' £ — ' m! rrv 



m=0 



(22) 
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(for all sufficiently small real t), where Re A is the Hermitian (A + A*)/2. If A i, ... , Xk are the nonzero eigenvalues 

(real) of ReA, the RHS of (22 1 is (nf=i(l - 2iA fc ))~ ; since the LHS is a function of t 2 , these eigenvalues come in ± 
pairs. We may assume that Ai , . . . , X p are the positive eigenvalues of Re A so that the spectrum of Re A is 

(Ai, A2, . . . , X p , 0, — Ai, — A2, • • • , — Aj,), 

where has multiplicity N — 2p. Note that p > 1 unless A = Ojv, since Re A = implies that A is skew-Hermitian 
and Proposition 8.2 then implies that A = 0. We may therefore write (22 1 in the following form: 



^ ml ml mm{ ' ~ n? , (1 - 4t 2 A 2 ) ' 



(23) 



The methods of section 7 extend most readily to the case where Ai, . . . , A p are distinct (as they are for A = Jn, where 
p = l^j and Aj = cos( j ^^ 1 )). The following more general proposition replaces Proposition 7.3. 

Proposition 8.5: If 7^ A E Mm has rotation-invariant shadow and the positive eigenvalues of Re A are the distinct 
Ai, . . . , A p then the planar shadow density at each z with \z\ — r is given by 



A 



2(p-2) 



(24) 



where the function Fn is computable as in section 7. 

Remark 8.6: To see that Proposition 7.3 is a special case of Proposition 8.5, recall from the proof of Proposition 
7.1 that 



9 N k-TT I 

[ } iV+T Sm ^W+l' = C^cos^j)) 



where Cn(x) = — cos(j^j-)). Thus 



(-1) 



fe-i 



N + 1 Sm2 (iV + f) UiKjKNjM^Ml) ~ C0S (l^T)) 
When Aj = cos(-^q-j-), j = 1, 2, . . . , N, the positive values are Ai, . . . , X p with p = j. 



1 



(25) 



Suppose first that N is odd; then p = (N — l)/2 and for k < p we have 



A 



2(p-2) 



,N-5 



^li<j<P,j^k( x l ~ x j) Ili<j<p,jjtki x k - Xj){X k + Xj) 



JV-5 



ni 



<i<v,i+k 



X n 

(Afe - Aj)(A fe - A w+1 _j) 



X k 5 ' ( X k — 0) • (Afe — Ajv + i_fc) 



ni 



<j<N,j^k 



(Afe - Aj) 



since A w+i = 0. Thus, in view of (25) 



A 



2(p-2) 



Hl<j<p,j^k( X k X j) 



Af- 5 • A fe ■ 2X k (-l) k - 1 ^- sin 2 (^^) 
fe k k\ > N + 1 v iV + 1 ; 



sin 2 (^^)Af r - 3 



and Proposition 7.3 follows from Proposition 8.5. The argument for even N is similar. 
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Proof of Proposition 8.5: Since \{, . . . , \ p are distinct, 



n? =1 (*-A?) ^(*-Al) ' 



E 



where 



With x = l/At\ the RHS of (23) becomes 



rwA 2 fc -A?) 



1 A 6 fc 



which we may write as 



k=l 



Comparing terms involving t 2m with ( 23 ) we see that 



TO!?n! ^j, A m^{m+p-l) 



In terms of the radial density / we have 



k=l 



J I 2 2 ' m fe=i 

so that (in view of the moments that -F/v was designed to have) 



2(m+p-l) 
fc 



it / x m f{y/x) da: = ( / x m F N (x) tfa) V M 
Jo Jo fe=1 

= E^Af" 1) ((A|r f x m F N {x)dx) 



Applying Proposition 7.2(i), 



,2(p-l) / fe m 1 r , ,. , \2 



x m ^F N (x/Xi)dx 



7T / a. m /(V5) da: = ^ 6 * A 
fe=i 

/•oo P 

■^0 7 1 



k=l 



Since all moments coincide, 



QED 



/(VS) = iE^Af- 2 V JV (x/A|). 



k=l 



Remark 8.7: Whether or not Ai, . . . , \ p are distinct, (23 
Thus B = ®fe =1 2Afc J2 ®0n-2 P has the same shadow as A 



shows that the shadow measure depends only on the A&. 
oecause the positive eigenvalues of ReB are also Ai, . . . , X p . 



All rotation-invariant numerical shadows are obtained as shadows of the simple superdiagonal matrices 
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where Ai, . . . , A p > 0. For example, A = J3 has the same numerical shadow as 



B 



y/2 0" 





Here we have another simple example of a pair of matrices with different ranks but the same shadow (compare the 
discussion at the end of section 6). 

One way to deal with the case of repetitions among Ai, . . . , X p is to follow the method of Proposition 8.5 but with the 
necessarily more complicated partial fraction decomposition. Suppose the distinct values are \x\, . . . , \x n and that pn 
occurs with multiplicity ki', then p = ki and det _1 (/ — t(A + A*)) is 



n ki — 1 

E\ " a ij 
^ fl-4f 2 «?)**-J' 



(26) 



for certain constants a,- 



Remark 8.8: The are functions of the eigenvalue data. Computationally effective expressions for these functions 
are available: see [Hnl974, pp. 553-562]. 



Let RN,k{y) be defined by 

r(f)r(^±i) 



-(l-y) N -i- k H 



N+l 



N 



r(iv-|-fc)(fe-i)! 

with the understanding that RN.k(y) = for y > 1. In view of Proposition 7.4, 

ml (k) m 



-k, — -k:N - --k;l-y), 
2 2 2 ' 



y M R N Av) dy 



(27) 



(28) 



2 > m 



Proposition 8.9: If ^ A e M n has rotation-invariant shadow and the positive eigenvalues of KeA are distinct 
/ii, . . . , n n where \ii has multiplicity ki, then the planar shadow density at each z with \z\ = r is given by 



(29) 



i=l i=0 



where a« are the constants occurring in ( 26 ) . 
Proof: From ( 23 1 we obtain 



E 



,2m ( N hm, 



ki-1 



=1 7=0 v 



71 fej — 1 OO / , . * 

( a « — ^ — ( 4i Mi) 

2 — 1 i=0 771=0 

where we have used the binomial theorem to express (1 — 4t 2 fif) ki ~^ (for small t). Comparing coefficients, 

, n ki — 1 

7 ' 



fci-1 



E ( J2 a ^ fa 



2 ml (ki-j) 7 



(K) (N+l) 
i=l j=0 1 2 Ml 2 A 
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In terms of the radial density f — Ja, we have 



77 / ^/(Vx) = V ( V an ( M ?) ro / 2/ m ^, fcl 

JO - -, V • n JO 



(recall (27) and (28)). With the substitutions x = fify, the RHS becomes 

n ki-1 



Because all moments coincide, 



i=l j=0 



1 



K Jo 



X \ 
1 ' R N t ki-j\^) dx ) 



and (291 follows. QED 



x 

~5j I ' 



i=l j=0 



Remark 8.10: For example, if all have the same value fj,, ie n = 1, /ii = /i, fci = p, the model matrix is 

A = 8? 2^J 2 8 0at_2 P 



and the only nonzero in (26) is a\ q = 1. Thus 



11 r\ 

In particular, if A = 0^ 2^i J 2 (i.e. iV = 2p) we have radial density 

11 r 2 ■ 

f{r) = -^i? 2p ,p(^ 



Recalling (27), we see that 



r(p-i)(p-i)i 



5 //f 1 - 1 



Thus 2/J.J2 has radial density 



(p - 3X1 - *r 1 = (p - ^)(yr^) 2p - 3 . 



11, L, r 2 ,„_3 



7T /i^ 



r(p-o)(i-^r s - 



iV 



Remark 8.11: There is a useful recurrence relation for the functions i?jv,fc. Apply Lemma 7.5 with a — , 
b = N — k — h, t = 1 — x to see that for N > 2k > 2 we have 



fc + 1, 



RN+l,k(x) 



N 



(N - 2)(N-2k) 



((2N -2k- 3)R N , k {x) — (N — 1)(1 — x)R N -i, k (x)). 



In using this recurrence relation to compute Rn,r one would start with Rik.k and R2k+i,k- In Remark 8.10 we saw 
that R2k,k{x) = (k — |)(Vl — x) 2k ~ 3 ; it may also be shown that 

W*) - ^^((-x)^ 1 io g ( 1 + ^V pfcoo^/r^), 



where Pk{x) is an explicitly computable polynomial of degree k — 2. 



2G 



Further insight into the case of repetitions among the X%, . . ■ , X p may be gained by considering the limit of (24) from 
Proposition 8.5 as some of the (initially distinct) Afc coalesce. This procedure is legitimate in view of the models 
A = Q)P 2X/.J2, which always have rotation-invariant shadows. In this approach the theory of divided differences 
plays an important role. Recall that, given a function g : [a,b] — > K and distinct yi,...,y p G [a, b], the divided 
difference 

r n g(Vk) 

^,..,y P ] = g n ^_ %) . 

We shall appeal to the following facts about such divided differences (compare chapter 4 of [CK1985]): 
g[yi, ■ ■ ■ , Up] is invariant under permutations of the y k \ 

g[yx, ■ ■ ■ ,y P ] = (30) 

yi - Vp 



if g is p — 1 times continuously differentiable on [a, b] then 



]im{g[yi,...,y p ]: all y k -> y } = ^ ■ (31) 



Setting yj — l/Xj in Proposition 8.5, we find that 



/(r) = i(-ir 1 (n%)5[y 1 



Vp\ j 

1 



where g(y) = FN(r 2 y). In this approach we see that if the distinct positive eigenvalues of Re^4 are fj,%, . . . ,fi n with 
multiplicities ki, . . . , k n , then the radial density f(r) for the numerical shadow of A may be computed as 

(_-i\p-l " 
f(r) = ^^(I]^)" 1 L(h, . . . , k n ), 

7T 

1 

where p = J2i h, 

L(ki, ...,k n ) = \im{g[y!, . ,y p ] :yj -> for i G (32) 
and the Ji partition {1, 2, . . . ,p} with #( J,) = 



The relations (30) and (31) provide us with a sort of L-calculus; for example, 

ran m _ g^Wti) _ r^-M^V//^) 
Mfc '°'---' 0) - (fc-1)! __ (fc-1)! ' 

and 

L(fci, . . . , fc n - 1) - L(fci - 1, . . . , k n ) 

Li\K\, ■ ■ ■ , K n ) — J J • 

Using those relations repeatedly, we find that 

fei-l „ 2 j 



L( kl , ...,fcn)=E(E I fi)) ( 33 ) 

for certain constants fiij. Again (compare Remark 8.8), the fiij are functions of the eigenvalue data. 



Summarizing, we have the following alternate method of computing /^(r). 
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Proposition 8.12: If ^ A £ M n has rotation-invariant shadow and the positive eigenvalues of KeA are distinct 
fix, . . . , fj, n where ^ has multiplicity fcj, then the planar shadow density at each z with \z\ = T is given by 

/W = ^— (I] ^T'EiE^T^ir 2 /^)), (34) 
1 i=i j'=o ^' 



where p = fcj and /?y are the constants found in ( 33 1 



Remark 8.13: A comparison of Propositions 8.9 and 8.12 suggests a relation between R^,k and the derivatives ' . 
Indeed, if 

A = ®\ 2 J 2 © N -2 P 

we have n = 1, k\ — p, fix — 1, atifi = 1, and 

i.e. /3i iP _i = 1 and all other = 0. The two forms for the radial density /a tell us that 

r 2(p-l) 

i.e. 



r 2(p-l) . 



R Nik (x) = (-l)^ 1 ^^ Fj^-^Cx). (35) 

This relation between the Rpf,k and the derivatives of (= Rn,i) nray also be obtained directly by using the identity 

H(a, b; c; t) = (l- t) c ~ a - b H{c - a,c- b;c;t). 

Remark 8.14: A study of the behavior of the radial density /a(x) (when ^ 4 £ Mn has rotation-invariant 
shadow) near x = reveals that it has dominant singularity x^ 1 logx there (recall that p is the number of positive 
eigenvalues of KeA, counted with multiplicity) unless N = 2p, in which case Ja{x) is analytic near x = 0. 

IX. NUMERICAL SHADOWS VIA THE CARTESIAN DECOMPOSITION 

In this section we discuss aspects of the numerical shadow related to the so-called Cartesian decomposition of a 
matrix A into its Hermitian components KeA and ImA = Rc(— iA). For example, we investigate the shadow of a 
possibly nonnormal matrix by means of projections onto lines in C. These projections have interpretations as shadows 
of Hermitian matrices and can also be thought of as Radon transforms of the shadow. We discuss how the eigenvalues 
of the sections are involved in the analysis of the map fijv — > C taking u to (Au,u). We remark that, in general, 
the shadow measure of a nonnormal matrix is absolutely continuous with respect to area measure on C = M 2 (see 
[GS2010]). 

A. Marginal densities 

Recall that the numerical range of a Hermitian matrix is real and the density of the shadow is a nonnegative spline 
function, straightforward to express in terms of the eigenvalues. This fact can be exploited by means of a type of 
Cartesian decomposition. 

Recall that for A £ Mm we define the real part of A by 

Re,4= I(A + A*). 
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Thus ReA is Hermitian, and A = RcA + iRc (— iA). We will be concerned with the more general Re (e 10 A) with 
— 7T < 9 < 7r; then A can be expressed as 

A = e l0 Re (e^ 6 'A) + ie l0 Re (-ie~ ie A) . (36) 
For — 7T < 9 < 7r let Xk(0) be the eigenvalues of Re(e~ l0 A), labeled so that 

Ai(0) < A 2 (0) < ... < Ajv(fi). 
Then for each u £ V, N we have Ai(0) < Re(e~* e (Au, u)) < Ajv(0) arid 

A J -(fl + 7r) = -A JV+ i_ j (fl) (1<j<JV). 

A matrix with the property that X\(9) < X 2 {9) < • • • < Xm(9) for < 9 < 2tt is called generic in the paper of 
Jonckheere, Ahmad and Gutkin [JAG1998]. 

We relate the shadow of Re (e~ t0 A) to a marginal density of Pa- Write the shadow measure (LPa (z) = Pa {z) dm 2 (z) 
(where dm 2 is the Lebesgue measure on C = K 2 ). Recall: if / (a;, y) is a density function on K 2 with compact support, 
then the marginal density along the x-axis is fx (x) — f (x, y) dy. Suppose g (x) is a continuous function; then 

/oo />oo />oo 

g (x) fx (x)dx= / g (x) f (x, y) dxdy. 

-oo J -co J —oo 

Thus the moments, x n fx (x) dx equal E [X n ] with respect to the density /. Now replace x by xcos6> + ysin# = 
Re(e~ l0 z) for some fixed 9. The line orthogonal to Re(e~ t6 z) = is Re (ie~ 10 z) = 0. Let u — Re (e~ t0 z) ,v = 
Re (ie~ 10 z); then z = e 10 (u — iv) and so the density of the shadow of Re (e~ 10 A) is the marginal density of Pa for u: 

/oo 
p A (e 10 (u - iv)) dv. 
-oo 

This is exactly the (2-dimcnsional) Radon transform of pa evaluated at (u, 6) ,u gl, — n < 9 < n. We may restate 
this as follows: suppose g (u) is real and continuous for u e E; then E [g (£/)] with respect to P^e^-™ A) lii 

f g(Rc(e-' (Aw 7 w)))d^w)= f g ( \ (e^ 6 z + e w z)) p A (z) dm 2 (z) 
Jn N Jw(A) \ z J 

/oo />oo 
g («) / PA (e lf> (u - iv)) dvdu = E[g (Re (e^ Z))] , 
-oo J —oo 

where the latter expectation is with respect to Pa- 

Remark: The Radon transform can be inverted to recover the shadow of A from the shadows of Re (e~ A), 
—it < 9 < 7r. There are practical algorithms, used in X-ray tomography, which produce approximations to the inverse 
transform by using a finite number of angles 9 (also see (also sec [Hel984, Ch. 1, Sect. 2]). The Radon transform 
approach is worked out thoroughly in [GS2010]. 

For A e M N let £_ A {s, t) = dct (/ - sA - tA*) and for a Hermitian matrix H let £ H ( r ) = det (/ - rH) ("^" suggests 
"characteristic"). For a power series h, [sH k ~\ h(s,t) denotes the coemcient of sH k in h(s,t), and [r n ] h(r) denotes 
to coefficient of r™ in h (r), j, k, n = 0, 1, 2, . . .. 

Recall from Proposition 5.3 that the moments of A can be obtained from £a, 

v jk (A) := / z>z k dP A (z) = [sH k ] U (s, ty 1 . 

Jw(A) K^)j+k 

The central moments of a probability distribution are also of interest. Let tua = jjtrA, then E [Z] = tua- The central 
moments can be computed by expanding the integrand in 

v% (A) := f (z- m A y (z - mj) k dP A (z) , 

JW(A) 
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or by using the shifted matrix A — tuaI- 
Lemma: For A e Mjq and c G C, 

£a- c /(M) = (l + sc + tc) N £ A 



1 + sc + tc 1 + sc + tc 



Proof: Indeed, 

U-d (s, *) = dct (7 - s (A - cl) - t (A* - cl)) 



= dct ((1 + sc + tc)I-sA- tA*) = (1 + sc + tc) N det (i - 



1 + sc + tc 



QED 

It is clear that the shadow of A — cl is a translate of Pa ■ Thus 

We consider the (one-dimensional) moments of Re (e~* e A) . 
Proposition 9.1: For n = 0, 1, 2, . . ., 

/oo I / -| ^ \ — 1 

_ « b ^(.-«a) («) = H Za ^re"" -re* J . 

Proof: The integral 

r « n <%(.-^) («) = i / (^ e z + e ie z) n dP A (z) 

J-oo y ' 1 JW(A) 

= ^E(;)e*- 2 '»^ ! [^-']a(M)-' 



i! 

(N) n 1, >^\2- '2 



QED 

That is, the moments of ^ 3 R, e (e- ie A) can ^ e obtained from £R e (e- i9 A) ( r )- Furthermore, 



N 



Here are the basic quantities associated to Pji e ( e -ie A y 
The mean of Pji e ^ e -ie A ) i s 



1 1 W 

— ( e "*tM + e ie tvA*) = Re (e^ e m A ) = ^ £ Xj (9) . 



3 = 1 
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The variance of Pj\gr e -ieA) ^ s 

2 



= 4N( N + l) ^ 2t9tY ( {A mAl)2 ) + 2tr ^ - m ^ 7) (A * W)) 
+ e 2i V((A* -fru/) 2 )). 

Let tr ((A - m A /) 2 ) = ae*^ with a > 0; then the variance of P^ e ^ e -io a) IS maximized at 6 = | and minimized at 
= | ± f • There is a relation with the 2-dimensional variance of Pa, namely, 

/ \z- m A | 2 dP A (z) = 1 tr ((A - m A J) (A* - mj/)) . 
The central moments of Re (e~ l ° A) can be obtained from 

SRe(e-«»A)-Re(e-«»rrM)/ ( r ) = ( x + Re (e~* 8 ™ A ) r) ^ £ Re(e - i9A) ( 1 + rRc (" e -i8 myl )) • 

B. The shadow of a Hermitian matrix 

The density function for Pjj is simple to find, given the eigenvalues of a Hermitian matrix H. Suppose H is not scalar; 
then H has at least two different eigenvalues and the shadow is absolutely continuous on R. Let dPn {x) = ph (x) dx. 
The following is the basic fact. 

Lemma 9.2: Suppose 1 < m < N and A ^ 0. For n = 0, 1, 2, . . . 



rn 



( N ^A 1 ^ [ X x n x m - 1 (X-xf- m - 1 dx= ( ^X n (A>0); 
V ™ J Jo (N) n 

(-A) 1 ^ /° x n (-x)™- 1 (x - Xf-" 1 - 1 dx = j^X n (A < 0). 



' N ~ ^ {-Xf- N ^ (a; _ Af— 1 = M 

To express the density functions for all real arguments we use the notation 

x + = max (x, 0) , 

with the convention that x° + — 1 for x > and = for x < 0. Thus the density in the first part of the lemma equals 
m( N m l )X 1 - N x^- 1 (A - xf^™- 1 for 

N M 

Suppose H is Hermitian, not a multiple of I, and (r) = (1 — rAj) = (1 — r^i) m ' , where {/Ui, . . . , ^m} is the 

i=l i=l 

set of distinct nonzero eigenvalues of H and Y^nLi m i — N. By hypothesis, H has at least two different eigenvalues 
(each rrii < N). There are are unique real numbers < i < M,l < j < rrii such that 

= g g (T^j 7 = g g ^' S ^ ^ r "' M < n^rnTR ■ 

Thus for n = 0,1,2, ... 



/oo Mm, , , 

-oo i=l ,=1 ^ >ri 
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By the lemma, the density function of Ph is 

PH ( x )= v\~ N Pijj { j 1 ) (ft - + 

Ml >0i=l \ J / 



C. The critical curves in W(A) 

Suppose that in some interval 9\ < 9 < 9 2 the eigenvalues of Re (e~ l ° A) are pairwise distinct, and there are 
eigenvectors ip^ (8) so that 

Re (e~ ie A) ^ (6) = Xj (9) ^ (9) , 

where 

V> (j) (9) =1 (1 < j < N). 
The image {(A^ (9) , (#)) : #i < 6> < 6> 2 } is called a critical curve (see [JAG1998,Theorem 5, p. 238]). 



Lemma 9.3: Suppose H (9) ,ip (9) , A (9) arc differcntiable functions on 9\ < 9 < 9 2 such that H (9) is Hcrmitian, 

^d9 J 



V> (9) e n N , X (9) e K, and if {9) $(0) = X(9)ip (9); then f Q X (9) = {{j-H (0))i/> (9) , i/>(0)). 



Proof: Write X(9)=tp (9)* H (9) ip (9) and differentiate to obtain 

% A (9) = V, (9)* (±H (0)) V> W + ^ (V- (*)*) H (0) ^ W + 1> (OT H (9) ^ (9) 
= 1> (OT (JgH (0)) tp (9) + X (9) (jL (9)*) 4>{9)+^ (9)* ^ (0)) 

because ^ (<?)* ^ (0) = 1. QED 

Proposition 9.4: For 9\ < 9 < 9 2 the critical curve satisfies 
(AV^W^^W) = (°T M U) (0) = e w (Xj (9) + iX'j (9)). 

Proof: Let H (9) = Rc (e- lB A) = (cos (9) A + {sm6)A 2 , where A x = \ (A + A*) and A 2 = ±. (A - A*). Thus 

^ (9)* Ail>W> (9) = ^ (9)* A^V) (9) + (9)* A 2 ^ (9) . 
By definition and the lemma 

V> (i) (0)* ((cos 9) A l + (sin 9) A 2 )^ (9) = Xj (9) , 

and 

^ (9)* (- (sin0) A x + (cos 9) A 2 )^ ] {9) = Xj (9) . 
These equations are easily solved to establish the formula. QED 

The analyticity of Xj (9) is shown in [JAG1998, Lemma 2, p. 240]. 

To get an idea of the structure of critical curves one has to distinguish the generic and non-generic cases. If A is 
generic then Xj (9 + it) = -Ajv+i-j (9) for 1 < j < N, the curve Cj := {e 10 (Xj (9) + iX'j (9)) : < 9 < 2tt} agrees 
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with Cjv+i-j (as a point-set) so there are |_^2~H critical curves (see [JAG1998, Theorem 13, p. 244]); the outside C\ 
is the boundary of Aa- Example 9.5.3 below is a generic 3x3 matrix. 

When using numerical techniques for solving the characteristic equation for some number of angles (for example 9 = 
jn/m, j = 0, . . . , m) the value of A' (9) can be computed as follows: let p (9, A) = det (A/ — Rc (e~* e ^4)) , differentiate 
the equation p (9, X(9)) — to obtain 

^p(9,\(9)) + \> (9) ^p(9,X(9)) = 

(so the value of A (9) determines A' (9) except possibly for isolated points where J^p (9, A (9)) = 0; this indicates 
repeated roots which do not occur in the generic case). 

In the non-generic case the same critical curve can arise from different eigenvalues: let 9 be an angle for which the 
eigenvalues are all distinct and ordered by Ai (#o) < • • • < Ajy ($o); consider each Ai (9) as a real-analytic function in 
9 and extend it to the interval 9 < 9 < 9 + it. Because this is the non-generic case the curves A, (9) may cross in 
the open interval (a finite number of times by analyticity) . Form a set-partition of {1,2,..., N} by declaring i and 
j equivalent if A^ (9 ) = —Xj (9 + 7r); the relation is extended by transitivity. The equivalence classes correspond to 
distinct critical curves. There may be only one class; consider Example 9.5.2. In this case the boundary of is 
the convex hull of the outside critical curve (from the class containing 1). 

In the situation of radially symmetric shadows (see section 8) the critical curves are circles centered at the origin. 



D. A geometric approach 



Any matrix can be expressed as a sum of two matrices with orthogonal 1-dimensional numerical ranges. For a fixed 



with — | < 9 < | we can write 



A = e id Re (e^A) + ie l ° Re (e-^^A) 

Let U u U 2 eU (JV) satisfy U{ Re (e- ie A) U x = B u U% Re (e-^ e+7T ^A) U 2 = B 2 where B U B 2 arc diagonal matrices, 
such that (Si) . . = Aj (9) and (B 2 ) . • = Xj (6 + §) for l<j< N. Then for V & Sl N 



^*Ail> = e l0 ^*AnP + ie ie ^*A 2 i) = e^^U^U^ + ie t6 ^*\J 2 B 2 XJ* 2 ^. 
By the unitary invariance of the range (and the shadow) we may replace (generic) ip by (generic) Uiip. Thus 

JV N 

(U^T A (U^) = e ie *i (0) + ie M ]T A, {o + \ ) | {U* 2 U^) 3 



(37) 



The value remains unchanged if U 2 U\ is replaced by U = D 2 U 2 U\D\ where Di,D 2 are arbitrary diagonal unitary 
matrices (in M N ). For example, choose Di,D 2 so that Uij > and Uj t \ > for 1 < j < N. Thus the numerical 
range and shadow can be interpreted in terms of a mapping from 

Tjv-i [U] := | ((l^l 2 ) J =1 > (l^l 2 ) J =1 ) : ^ e Qn ) c An x An 



to C. Every vector ( \ipj\ ) appears as a first and as a second component of a point in T/v-i [U]. The unitarily 

invariant measure on i7jv induces a measure on T^^i [U], and the shadow of A is the image of this measure under the 
map 



(m 2 ) N =i , (i^-i 2 h- e ie j2 (a, (9) m 2 +^ \w) 
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The case N — 2 can be explicitly described. Let 

U 



COS V sin t>o 
sin O — cos 6*o 



e" V2 sine/i 



with < 0q, 0i < | and — 7r < 0i, 02 < i". It suffices to consider £/ of this form (0 O is fixed). Then 



and 



Thus Ti [U] is 



{( 



l^i 



\Uil>, 



1 1 



f f 

2 + 2 



cos 26>i cos 20 o + x sin 20i sin 20 o cos (</>i — </> 2 ) 



,2 1 1 1 



2 2 



cos 2#i cos 20 o — - sin 20i sin 20 o cos (</>i — (f> 2 ) . 



1 f 



2 2 



2 2 



1 f 



f f 



+ o*l .o-^ (*l) ho + o^ (01, 02, 0) , X - ^*2 (01, 02, ^) : 



2 2 



2 2 



O<0i < |,0 < < tt}, 

where x\ (0i) = cos20i and 

X2 (01 , 0o , 4 1 ) — cos 20i cos 20 o + sin 20i sin 20 o cos (f>. 

As expected (compare section 2) this forms an ellipse (including the interior). Changing coordinates we transform 
to the square {(xi,x 2 ) : —1 < Xi,x 2 < 1}; then T\ [U] maps to {(£1,2:2) : x\ — 2xix 2 cos20 o + x 2 < sin 2 20 o }. In the 
degenerate normal case this reduces to the interval {(xi,xi) : —1 < x\ < 1}. 

The invariant measure on fi 2 is 7^2 sin0i cosQidOidfyidfa. This is mapped to the measure 

I _ 1 

— (sin 2 20 o — x\ + 2x\x 2 cos 20 o — x 2 .) 2 dx\dx 2 . 



E. Examples 

Example 9.5.1 Let 

/o 1 0\ 
. 10 

1 ~ 0001 • 

\0 0/ 

Then£ Al (s,t) = l-3st + s 2 t 2 and ^ Re(e _ i9Ai) (r) = l-|r 2 + ^r 4 . The eigenvalues of Re (e'^A^ arc | (±l±\/5), 
independent of 0, labeled so that Ai < A 2 < < A 3 < A4. Thus the density for Re(e~ l6 A{) is given by 




In fact the shadow has circular symmetry, as discussed in sections 7 and 8. The critical curves are z = \ (l + v^) e v 
and z = \ (v/5 - l) e ld , < < 2tt. 



34 



Example 9.5.2 Let 



Then 



and 



A* = 



/O 1 1 1\ 

11 

1 

vO 0/ 



^4 2 ( s , t) = l-6st- Ast (s + t)- st (s 2 + st + t 2 ) 



tRe(e-*<>A 2 ) W = 1 + r - r 



1 1 

cos ( 

4 2 



1 — r — r 



1 1 

4 + 2 



cos 6 



The eigenvalues of Re (e 10 A 2 ) are | ± cos |, — \ ± sin |. The eigenvalues are — ^, — ^, — ^] at 9 = and ±| ± ^ 



S> 0.0 




-0.5 



-0.5 0.0 



1.0 1.5 




-0.5 0.0 



1.0 1.5 



FIG. 4: Numerical shadow for matrix A 2 of size N = 4 and the corresponding critical lines, 
at 9 = In the range — | < < | we have 



sin - 
2 



and so — ^ — | sm || — Re (e l0 ip*A2ip) < \ + cos | for -0 G Sl^. The triple eigenvalue —5 at = results in a 
pronounced peak in the density for Re(A2): 

P2(x) = 6 (~ + + 9 Q + x) (-x) + + ~ Q + x) (-x)° 



3 /3 



The matrix A 2 is non-generic and there is only one critical curve which has two cusps as shown in Fig. [4j In this 
example the boundary of the shadow is the convex hull of the critical curve, so the line segment [(— 1 — i)/2, (— l + i)/2] 
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is a part of the boundary. One representation of critical lines is z = e l6 (| + cos | — § sin |) , < 9 < 4tt. The line 

— 19 i 5^/5 

segment joining ~ 1 2 ~ - ' to ~ 1 9 + ' t is part of the boundary of Aa 2 - The cusps are at — . 



Example 9.5.3 Let 



Then = I (—1 + i), 



'0 1 1 
-1, = I i 1 

,0 -1, 



U 3 (s,t) = l + (l-i)s+(l + i)t-i (s 2 - t 2 ) - 3st - st ((2 - i) s + (2 + i) t) , 

U 3 -m A3 ! (S, t) = l- fst - l -(s 2 - t 2 ) + A ((1 + i) s 3 + (1 _ i} t3) 



.sf 



((8 + i)s+(8-i) i), 



and 



sm a cos W r 



1 1 , . 

sin 9 - - cos 9 } r 



^Re(e-«"A 3 ) (0 = 1 + (cos - sin^) r - + sh. „ ) . ^ , ^ 

The variance of -PR^e-^As) ec l ua l s ^ (13 + 2 sin 20). The eigenvalues of Re {e~ lB A-$) can be found approximately, 



1.0 - 



0.5 - 



0.0 - 



-0.5 





FIG. 5: Numerical shadow for matrix A 3 of size N = 3 and the corresponding critical lines 

For 9 — the eigenvalues are 



l+vTT 



1 

" 2 ' 



-1+yTf 



for = and 



or analytically by the classical formula. ..... .. — . .... , 

[—0.6715,0.2647, 1.407] (rounded) for 9 = | . The matrix is generic; there are two critical curves: one is the boundary 
of Aa 3 and the other is a triangular curve with three cusps. A comparison of the numerical shadow for this matrix 
and its critical lines is presented in Fig. [5J Since N = 3 the density for Re (e" i( %) is piecewise linear: 



P 3 (x) = (-x)° + [x + 



1 + VV7 



-2(-a0° 



y/Vf-l 



3G 



X. DIRECT SUMS (BLOCK DIAGONAL MATRICES) 

Concerning the direct sum A © B (block diagonal matrix) of matrices A, B, it is well-known that 

W(A ®B) = corw{W(A) U W(B)} 

(see, for example, [B1997, Exercise 1.3.1]). In our context it is natural to ask how the numerical shadow of A © B is 
distributed over conv{W(A) U W(B)}. Here A and B may be of different sizes - see an example presented in Fig. [6] 
We consider then A © B £ Mjy with A £ M n and B £ M m , so that n + m = N. Given u £ Vl^ (distributed according 



to [i, as usual), let u — v\ © v 2 where v\ £ C" and v 2 £ C m ; then ||wi|| 2 
a beta-density given by 



1. It is known that t = \\vi\\ has 



lit) 



{n + m -1)1 x _ , 
(n-l)I(m-l)! 1 ' 



(t £ [0, 1] 



(38) 



From this one can deduce that the shadow measure Pa©s is an "(n,m)-beta mixture" of the shadow measures Pa 
and P B . Compare [GS2010, section 2.2]. 
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-1.2-1.0-0.8-0.6-0.^0.20.0 0.2 

x 

FIG. 6: Shadow of block diagonal matrix ( ~^ [j ) © ( i ) 



Another version of this result relates the densities corresponding to A, B, and A © B: 

Proposition 10.1: If pa(z),pb(z) are the shadow densities for A, B, then the corresponding density p for A © B is 
given by 



p(z)= f q{t)U t- 2 PA ((z-w)/t)(l-ty 2 p B (w/{l-t))dw^dt, 



(39) 



where q(t) is as in (38). 
Proof: For u £ f2jv we have 



{{A © B) it, it) — (Avi, vi) + (Bv 2 , 1*2) 



= ||wi|| 2 (^Ui,Ui) + \\V2\\ 2 {BU2,U 2 ), 

where u.j = Vj/\\vj\\. Note that u x £ Q n , u 2 £ r2 m , ui,u 2 are stochastically independent, and they have the 
corresponding uniform distributions over f2„.f2 m . 
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Hence ((A © B)u,u) = tZ\ + (1 — t)Z 2 where Z\ and Z 2 are independent complex random variables with densities 
Pa(z) and pb(z). Thus 

p( z ) = / q{t)g(z,t)dt 
Jo 

where g(z,t) is the density of the independent (for each fixed t) sum tZ\ + (1 — t)Z 2 . This density is given by the 
usual convolution formula 

9(z,t)= / 9i(z-w)g 2 (w)dw, 
Jc 

where gi is the density of tZ\ and g 2 is the density of (1 — t)Z 2 . If a complex random variable Z has density h(z) 
with respect to area on C, then tZ (where t € M) has density t~ 2 h(z/t). Hence g\{z — w) = t~ 2 pA((z — w)/t), 



32 H = (1 - t)- 2 p B (w/(l - t)) and (39) follows. QED 



XI. ZERNIKE EXPANSIONS 



Given our methods for evaluating the moments of shadow measures (see section 5), it is natural to construct 
orthogonal polynomial approximations using Zernike polynomials. These provide one way to generate pictures of 
specific numerical shadows. 

The complex Zernike polynomials Z mn (z, z) are orthogonal for area measure on the unit disk {z G C : \z\ < 1}. They 
can be defined by 

Z mn (z,z) = z— 7 {m Xu Tl ^ (m ^ n) 

^ (m- 3)1 {n- 



Zmn (z, -z) = i Xu (-tf ^ n ~ j ( m < 

(m- 3)1 {n- 



and satisfy the orthogonality relations 



^ ^ Z mn (z, z) Z k i (z, z)dm 2 (z) = 



/| Z |<1 m + n+1 

Suppose / (z, z) is continuous on the disk and has coefficients 

fmn-= / / f (z,z) Z mn (z,z)dm 2 (z) ,m, n = 0, 1,2, ... 
•/ J|*|<i 

then 

1 

/ = -y^y^(w + n+l) fmnZ-mn, 

with convergence at least in the I/ 2 -sense. As is typical of Fourier expansions, the convergence behaviour is better for 
smoother functions /. If / is real then f mn = f nrn . 

Suppose A is an Ax A matrix whose numerical range is contained in the unit disk (otherwise work with B — ciA+cqI, 
with Ci > so that tr(i?) = and the range of B satisfies the boundedness condition). We may use Proposition 5.3 
to determine the moments of the shadow Pa (and we write dPA (z) = pa (z) dm 2 (z), so that pa is the density). Thus 

777 I77 ! -• 

z m z n dp A (z) = [ S m n u (s, ty 1 , 

|*|<1 l iV Wn 
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where £a (s, t) — det (7 — sA — tA*) and [s m i™] g (s, t) denotes the coefficient of s m t n in the power series expansion of 
g centered at (s, t) = (0,0). 

ft is then straightforward to compute the Zernike coefficients of the density: 

(PA) mn = / Z mn (z, z)pa (z) dm 2 (z) 

J\z\<l 

= E Tim { - iy t^" 1 "] u ' 

1=0 ■'' ^ 'm+n-2j 



for m > n (and {p A ) nm — {VA) mn )• As an approximation, one may compute {p A ) mn for all (m, n) with m + n < M 
for some M (say fO or 20). To write our formulas in real terms with z = x + iy (and x 2 + y 2 < 1 ), let 

min(m,n) _ 
Qmn (u) = ]T i Xu Jj .;,., (-l)^ min(T "'" ) - J . 

^ (m-j)!(n-j)!j! 

Note the trivial identity ab + ab = 2ReaRc6 — 21malm6. Thus the partial sum for pa can be written as: 

LM/2J 

Y (2j + l)(p A y jj Q jj (x 2 + y 2 ) + 

3=0 



+ 2 E R ° (( X + ' l Vy) E ( 2n + 3 + X ) RC ((P^)n+J,n) Q"+i,n 



M / L(M-i)/2j 

2 Ue^ + i^') £ 

j=l \ n=0 
M / L(M-j)/2J 



2 ' J/ 2 ) 



- 2 E Im ((* + i ») J ) E t 2 " + J + X ) Im ((P^)n+J>) 0n+,> + ?/ 
j=l \ n=0 

(The factor 7r has been ignored; it is merely a change of scale), ft is a matter for experimentation to produce useful 
graphs for a given matrix. The polynomials tend to wiggle close to the edge of the disk; the graphs can not be expected 
to precisely show the boundary of the numerical range, but they do indicate the behaviour of p A in the interior. 



XII. NUMERICAL SHADOWS AND THE HIGHER-RANK NUMERICAL RANGES 

The rank-A: numerical ranges, denoted below by Afc, were introduced c. 2006 by Choi, Kribs, and Zyczkowski 
as a tool to handle compression problems in quantum information theory. Since then their theory and applications 
have been advanced with remarkable enthusiasm. The sequence of papers [CHKZ2007,CGHK2008,W2008,LS2008], 
for example, led to a striking extension of the classical Toeplitz-Hausdorff theorem (convexity of W{M))\ all the 
Afc(M) are convex (though some may be empty), and they are intersections of conveniently computable half-plancs 
in C. Among the many more recent papers concerning the Afc(M), let us mention [LPS2009,GLW2010]. 

Given a matrix M € M N and k > 1, Choi, Kribs, and Zyczkowski (see [CKZ2006a,CKZ2006b]) defined the rank-/c 
numerical range of M as 

Afe(M) = {A e C : 3P e P k such that PMP = AP}, 

where Pk denotes the set of rank-fc orthogonal projections in M^. ft is not hard to verify that Ak(M) can also be 
described as the set of complex A such that there is some fc-dimensional subspace S of C N such that (Mu, u) = A for 
all unit vectors in S. In particular, we see that 



W(M) = Ai(M) D A 2 (M) D A 3 (M) D 
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012345 01234 



x x 

(a) Shadow of diag(0, 1, 3, 5). (b) Shadow of diag(0, 1, 3, 5) with respect to real vectors. 

FIG. 7 

This point of view also suggests that these higher-rank numerical ranges should be visible as regions of higher 
density within the numerical shadow of M. This idea is borne out, to some degree, by examining shadow densities 
of various matrices. In Figure (7^a), for example, we see the (one-dimensional) shadow density of the Hermitian 
diag(0, 1,3,5) - a spline of degree 2. Here it is known that A2(M) = [1,3]; while the density is unimodal, there are 




-0.5 0.0 0.5 1.0 -0.5 0.0 0.5 1.0 



x x 



(a) Shadow of unitary matrix in Mg. (b) Shadow of unitary matrix in M5 with respect to real 

vectors. 



FIG. 8 
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values in [3,5] that are greater than some in [1,3]. If real unit vectors are used in such experiments, the higher-rank 
numerical ranges often seem to be revealed more clearly. ; compare Figure Wtb) ■ 

A similar phenomenon is seen in Figure [8j Here M is a unitary matrix in M 5 and it is known that A 2 (M) is the 
inner pentagon (with interior) formed by lines joining the non-adjacent eigenvalues in pairs. The shadow density is 
unimodal, but A%{M) is only seen distinctly in Figure |8^b) , where only real unit vectors in u G are use to generate 
the values (Mu, u) . The distinction between shadows based on complex vs real unit vectors is a reflection of the fact 
that the latter follow a Dirichlet distribution (with parameter 1/2) rather than our usual measure fi. See [BZ2006]. 
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